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Description 

Field of the Invention 

The present invention relates to a polyester synthase gene, a recombinant vector containing the gene, a transform- 
ant carrying the recombinant vector, and a process for producing polyester by use of the transformant 

Background of the Invention 

It is known that a large number of microorganisms biosynthesize poly-3-hydroxybutyrate (P(3HB)) and store it in 
the form of ultra! ine particles as an energy source in the body. P(3HB) extracted from microorganisms is a thermoplastic 
polymer with a melting temperature of about 180 -C. and because of its excellent biodegradability and biocompatibihty 
it is drawing attention as "green" plastic for preservation of the environment. Further, P(3HB) is "green* plastic which 
can be synthesized from regenerate carbon resources including sugars and vegetable oils by various microorganisms. 
However, P(3HB) is a highly crystalline polymer and thus has the problem in physical properties of inferior resistance 
to impact, so its practical application has never been attempted. 

Recently, polyester P(3HB-co-3HH) as a random copolymer of 3-hydroxybutyrate (3HB) and 3-hydroxyhexanoate 
(3HH) and a process for producing the same have been studied and developed, and these are described in e.g. Japa- 
nese Patent Laid Open Publication Nos, 93049/1993 and 265065/1995 respectively. In these publications, the P(3HB- 
co-3HH) copolymer is produced from alkanoic acids or olive oil by fermentation with Aeromonas eaviae. isolated from 
soil It is revealed that because the degree of crystallinity of the P(3HB-co-3HH) copolymer produced through fermen- 
tation is reduced with an increasing ratio of the 3HH unit in it. so that the copolymer becomes a soft polymeric material 
excellent in thermostability and formability and can be manufactured into strong yarn or transparent flexible Mm (Y. Doi, 
S Kitamura, H. Abe. Macromolecules 28, 4822-4823 (1995)). However, the yield of polyester (content of polyester in 
'dried microorganisms) according to the processes described in Japanese Patent Laid Open Publication Nos. 
93049/1993 and 265065/1995 is low, and thus there is demand for developments in a process for producing the copo- 
tymehzed polyester P(3HB-co-3HH). 

Summary of the Invention 

The object of the present invention is to provide a polyester synthase gene, recombinant vectors containing the 
gene, transformants transformed with the recombinant vectors, and processes for producing polyester by use of the 
transformants. . ... 

As a result of their eager research, the present inventors succeeded in producing the polyester in high yieW by clon- 
ing a polyester synthase gene and deleting one or both of open reading frames located upstream and downstream of 
said gene to anive at the completion of the present invention. 

That is, the present invention is a polyester synthase gene coding for a polypeptide containing the ammo acid 
sequence of SEQ ID NO:2 or a sequence where in said amino acid sequence, one or more amino acids are deleted, 
replaced or added, said polypeptide bringing about polyester synthase activity. Said gene includes those containing e.g. 
the nucleotide sequence of SEQ ID NO:1 

Further, the present invention is a gene expression cassette comprising said polyester synthase gene and either of 
open reading frames located upstream and downstream of said gene. In said gene expression cassette, the open read- 
ing frame located upstream of the polyester synthase gene includes those (e.g. SEQ ID NO:3) containing DNA coding 
for the amino acid sequence of SEQ ID NO:4, and the open reading frame located downstream of the polyester syn- 
thase gene includes those (e.g. SEQ ID NO:5) containing DNA coding for a polypeptide containing the amino acid 
sequence of SEQ ID NO:6 or a sequence where in said amino acid sequence, one or more amino acids are deleted, 
replaced or added, said polypeptide bringing about enoyl-CoA hydratase activity. 

Even if one or more amino acids in the amino acid sequence of SEQ ID NO:2 have undergone mutations such as 
deletion replacement, addition etc.. DNA coding for a polypeptide containing said amino acid sequence is also con- 
tained in the gene of the present invention insofar as the polypeptide has polyester synthase activity. For example, DNA 
coding for the amino acid sequence of SEQ ID NO:2 where methionine at the first position is deleted is also contained 
in the gene of the present invention. 

Further, the present invention is recombinant vectors comprising said polyester synthase gene or said gene 
expression cassette. 

Further, the present invention is transformants transformed with said recombinant vectors. 

Further, the present invention is processes for producing polyester, wherein said transformant is cultured in a 
medium, and polyester is recovered from the resulting culture. Examples of such polyester are copolymers (e.g. poly(3- 
hydroxybutyrate-co-3-hydroxyhexanoate) random copolymers) of 3-hydroxyalkanoic acid represented by formula I: 
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I - 

HO — CH - CH 2 — COOH 



(I) 



w wherein R represents a hydrogen atom or a C1 to C4 alkyf group. 

Brief Description of the Drawing 

FIG. 1 shows the structure of the gene of the present invention. 
is FIG. 2 is a photograph showing the result of SDS-polyacrylamide gel electrophoresis. 

Detailed Description of the Invention 

Hereinafter, the present invention is described in detail. 

20 

(1) Cloning of Polyester synthase gene 

The polyester synthase gene of the present invention is separated from a microorganism belonging to the genus 
Aeromonas . 

25 First, genomic DNA is isolated from a strain having the polyester synthase gene. Such a strain includes e.g. Aerom- 
onas caviae . 

Any known methods can be used for preparation of genomic DNA. For example, Aeromonas caviae is cultured in 
LB medium and then its genomic DNA is prepared by the hexadecyl trimethyt ammonium bromide method (Current Pro- 
tocols in Molecular Biology, vol. 1 , page 2.4.3.. John Wiley & Sons Inc., 1 994). 

30 The DNA obtained in this manner is partially digested with a suitable restriction enzyme (e.g. Sau3AI. BamHI, Bglll 
etc.) and then the DNA fragments are then dephosphorylated by treatment with alkaline phosphatase. It is ligated into 
a vector previously cleaved with a restriction enzyme {e.g. BamHI, Bglll etc.) to prepare a library. 

Phage or plasmid capable of autonomously replicating in host microorganisms is used as the vector. The phage 
vector includes e.g. EMBL3, M13, X gt1 1 etc., and the plasmid vector includes e.g. pBR322, pUC18, and pBluescript II 

35 (Stratagene). Vectors capable of autonomously replicating in 2 or more host cells 6uch as £. coJi and Bacillus brevis. as 
well as various shuttle vectors, can also be used. Such vectors are also cleaved with said restriction enzymes so that 
their fragment can be obtained. 

Conventional DNA ligase is used to ligate the resulting DNA fragments into the vector fragment. The DNA frag- 
ments and the vector fragment are annealed and then ligated to produce a recombinant vector. 

40 To introduce the recombinant vector into a host microorganism, any known methods can be used. For example, if 
the host microorganism is E. coJi, the calcium method (Lederberg. E.M. et al., J. Bacterid. 119, 1072 (1974)) and the 
electroporation method (Current Protocols in Molecular Biology, vol. 1. page 1.8.4 (1994)) can be used. If phage DNA 
is used, the in vitro packaging method (Current Protocols in Molecular Biology, vol. 1 „ page 5.7.1 (1994)) etc. can be 
adopted. In the present invention, an in vitro packaging kit (Gigapack II, produced by Stratagene etc.) can also be used. 

45 To obtain a DNA fragment containing the polyester synthase gene derived from Aeromonas caviae. a probe is then 
prepared. The amino acid sequences of some polyester synthase have already been known (Peoples, O.P and Sins- 
key. A.J.. J. Biol. Chem., 264, 15293 (1989); Huisman, G.W. etal., J. Biol. Chem.. gfifi, 2191 (1991); Pieper. U. et al.. 
FEMS Microbiol. Lett.. 96. 73 (1992) etc.). Two conserved regions are selected from these amino acid sequences, and 
nucleotide sequences coding them are estimated to design oligonucleotides for use as primers. Examples of such oli- 

so gonucleotides include, but are not limited to, the 2 oligonucleotides 5'-CC(C/G)CC(C/G)TGGATCAA(T/C)AAGT 
(T/A)(T/C)TA(T/C)ATC-3* (SEQ ID NO:7) and 5-(G/C)AGCCA (G/C)GC{G/C)GTCCA(A/G)TC(G/C)GGCCACCA-3 , 
(SEQ ID NO:8). 

Polymerase chain reaction (PCR) (Molecular Cloning, vol. 2. page 14.2 (1989)) is carried out using these oligonu- 
cleotides as primers and the genomic DNA of Aeromonas caviae as a template. The partial fragment of polyester syn- 
55 thase gene is amplified by PCR. 

Then, the partially amplified fragment thus obtained is labeled with a suitable reagent and used for colony hybridi- 
zation of the above genomic DNA library (Current Protocols in Molecular Biology, vol. 1, page 6.0.3 (1994)). 

The £. coii is screened by colony hybridization, and a plasmid is recovered from it using the alkaline method (Cur- 
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rent Protocols in Molecular Biology, vol. 1 , page 1 .6.1 (1994)). whereby a DNA fragment containing the polyester syn- 
thase gene is obtained. 

The nucleotide sequence of said DNA fragment can be determined in e.g. an automatic nucleotide sequence ana- 
lyzer such as 373A DNA sequencer (Applied Biosystems) using a known method such as the Sanger method (Molec- 
5 ular Cloning, vol. 2, page 13.3 (1989)). 

The nucleotide sequence of the polyester synthase gene of the present invention is shown in SEQ ID NO:1, and 
the amino acid sequence encoded by said gene is shown in SEQ ID NO:2, where some amino acids may have under- 
gone mutations such as deletion, replacement, addition etc. insofar as a polypeptide having said amino acid sequence 
brings about polyester synthase activity. Further, the gene of the present invention encompasses not only the nucle- 
io otide sequence coding for the amino acid sequence of SEQ ID NO:2 but also its degenerated isomers which except for 
degeneracy codons, code for the same polypeptide. 

The above mutations such as deletion etc. can be induced by known site-directed mutagenesis (Current Protocols 
in Molecular Biology, vol., 1 , page 8.1.1 (1994)). 

After the nucleotide sequence was determined by the means described above, the gene of the present invention 
is can be obtained by chemical synthesis or the PCR technique using genomic DNA as a template, or by hybridization 
using a DNA fragment having said nucleotide sequence as a probe. 

(2) Preparation of Transformant 

20 The transformant of the present invention is obtained by introducing the recombinant vector of the present invention 

into a host compatible with the expression vector used in constructing said recombinant vector. 

The host is not particularly limited insofar as it can express the target gene. Examples are bacteria such as micro- 
organisms belonging to the genus Alcaligenes . microorganisms belonging to the genus Pseudomonas. microorgan- 
isms belonging to the genus pacillus . yeasts such as the genera Saccharomvces. Candida etc., and animal cells such 

25 as COS cells, CHO cells etc. 

If bacteria such as microorganisms belonging to the genus Alcaligenes . microorganisms belonging to the genus 
Pseudomonas etc. are used as the host, the recombinant DNA of the present invention is preferably constituted such 
that it contains a promoter, the DNA of the present invention, and a transcription termination sequence so as to be capa- 
ble of autonomous replication in the host. The expression vector includes pLA2917 (ATCC 37355) containing replication 

30 origin RK2 and pJRD215 (ATCC 37533) containing replication origin RSF1010, which are replicated and maintained in 
a broad range of hosts. 

The promoter may be any one if it can be expressed in the host. Examples are promoters derived from E. cpJL 
phage etc., such as trp promoter, lac promoter, P L promoter, P R promoter and T7 promoter. The method of introducing 
the recombinant DNA into bacteria includes e.g. a method using calcium ions (Current Protocols in Molecular Biology. 
35 vol. 1. page 1.8.1 (1994)) and the electroporation method (Current Protocols in Molecular Biology, vol. 1, page 1.8.4 
(1994)). " 

If yeast is used as the host, expression vectors such as YEp13, YCp50 etc. are used. The promoter includes e.g. 
gal 1 promoter, gal 10 promoter etc. To method of introducing the recombinant DNA into yeast includes e.g. the elec- 
troporation method (Methods. Enzymoi., 194, 182-187 (1990)), the spheroplast method (Proc. Natl. Acad. Sci. USA, §4. 
40 1929-1933 (1978)). the lithium acetate method (J. Bacterid.. 15& 163-168 (1983)) etc. 

If animal cells are used as the host, expression vectors such as pcDNAI, pcDNAI/Amp (produced by Invitrogene) 
etc. are used. The method of introducing the recombinant DNA into animal cells includes e.g. the electroporation 
method, potassium phosphate method etc. 

The nucleotide sequence determined as described above contains the polyester synthase gene as wen as a plu- 
45 rality of open reading frames (ORFs) upstream and downstream of it. That is, the polyester synthase gene forms an 
operon with at least 2 ORF's under the control of a single promoter region. 

The ORF's which are located respectively upstream and downstream of the polyester synthase gene are referred 
to hereinafter as "ORFT and "ORF3". 

It is considered that ORF1 is an open reading frame of a gene involved in accumulating polyester in the microor- 
60 ganism or a gene in the polyester biosynthesis system. It was revealed that ORF3 is an open reading frame of a gene 
coding for enoyl-CoA hydratase (particularly (R)-specif ic enoyl-CoA hydratase) involved in biosynthesis of polyester. 

As shown in FIG. 1. an EcoRI fragment carrying an expression regulatory region (expressed as "-35/-10" in FIG. 
1 A), the polyester synthase gene. ORF1. and ORF3 was cloned in the present invention (FIG. 1A). This fragment is 
designated EE32. 

55 Then, a fragment (a gene expression cassette) is prepared by deleting ORF1 and/or ORF3 from EE32, and this 
cassette is introduced into a host whereby a transformant capable of efficiently producing polyester can be obtained. 

In EE32, a restriction enzyme Bglll sites are introduced into regions between the expression regulatory region and 
the translation initiation codon of ORF1 and between the translation termination codon of ORF1 and the translation ini- 
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tiation codon of the polyester synthase gene, and then ORF1 is deleted from EE32 by treatment with BglM (FIG. 1B). 
Similarly, a restriction enzyme BamHI sites is introduced into a region between the translation termination codon of the 
polyester synthase gene and ORF3. and then ORF3 is deleted by treatment with BamHI (FIG. 1C). 

To delete both ORFl and ORF3. EE32 may be subjected to the above operation of deleting ORF1 and ORF3 (FIG 

1D). 

The restriction enzyme sites can be introduced by site-directed mutagenesis using synthetic oligonucleotides (Cur- 
rent Protocols in Molecular Biology, voJ. 1 , page 8.1 .1 (1994)). 

Each gene expression cassette thus obtained is inserted into said plasmid capable of expression (e g. pJRD215 
(ATCC 37533)) and the resulting recombinant vector is used to transform Alcalipenes eutrophus PHB-4 (DSM541) 
(strain deficient in the ability to synthesize polyester). The method for this transformation includes e.g. the calcium chlo- 
ride method, rubidium chloride method, low pH method, in vitro packaging method, conjugation transfer method etc. 

(3) Production of Polyester 

The production of polyester is carried out by culturing the transformant of the present invention in a medium, form- 
ing and accumulating the polyester of the present invention in the microorganism or in the culture, and recovering the 
polyester from the cultured microorganism or from the culture. 

A conventional method used for culturing the host is also used to culture the transformant of the present invention. 

The medium for the transformant prepared from a microorganism belonging to the genus Alcaligenes or Pseu- 
domonas as the host include a medium containing a carbon source assimilable by the microorganism, in which a nitro- 
gen source, inorganic salts or another organic nutrition source has been limited, for example a medium in which the 
nutrition source has been limited to 0.01 to 0. 1 %. 

The carbon source is necessary for growth of the microorganism, and it is simultaneously a starting material of pol- 
yester. Examples are hydrocarbons such as glucose, fructose, sucrose, maltose eta Further, fat and oil related sub- 
stances having 2 or more carbon atoms can be used as the carbon source. The fat and oil related substances include 
natural tats and oils, such as corn oil, soybean oil, saff lower oil, sunflower oil, olive oil, coconut oil, palm oil, rape oil, fish 
oil, whale oil, porcine oil and cattle oil, aliphatic acids such as acetic acid, propionic acid, butanoic acid, pentanoic acid, 
hexoic acid, octanoic acid, decanoic acid, lauric acid, oleic acid, palmitic acid, linolenic acid, linolic acid and myristic 
acid as well as esters thereof, alcohols such as ethanol, propanol, butanol, pentanol. hexanol. octanol, lauryl alcohol, 
oleyf alcohol and palmrtyl alcohol as weD as esters thereof. 

The nitrogen source includes e.g. ammonia, ammonium salts such as ammonium chloride, ammonium sulfate, 
ammonium phosphate etc., peptone, meat extract, yeast extract, corn steep liquor etc The inorganic matter includes 
e.g. monopotassium phosphate, dipotassium phosphate, magnesium phosphate, magnesium sulfate, sodium chloride 
etc. 

Culture is carried out usually under aerobic conditions with shaking at 25 to 37 °C for more than 24 hours (e.g. 1 to 
7 days) after expression is induced. During culture, antibiotics such as ampicillin, kanamycin, arrtipyrine, tetracycline 
etc. may be added to the culture. Polyester is accumulated in the microorganism by culturing it, and the polyester is then 
recovered. 

To culture the microorganism transformed with the expression vector using an inducible promoter, its inducer can 
also be added to the medium. For example. isopropyt-p-D-thiogalactopyranoside (IPTG), indoleacrylic acid (IAA) etc. 
can be added to the medium. 

To culture the transformant from animal cells as the host, use is made of a medium such as RPMI-1640 or DMEM 
which may be supplemented with fetal bovine serum. Culture is carried out usually in the presence of 5 % C0 2 at 30 to 
37°C for 14 to 28 days. During culture, antibiotics such as kanamycin, penicillin etc. may be added to the medium. 

In the present invention, purification of polyester can be carried out e.g. as follows: 

The transformant is recovered from the culture by centrifugation, then washed with distilled water and dried. There- 
after, the dried transformant is suspended in chloroform and heated to extract polyester from it. The residues are 
removed by filtration. Methanol is added to this chloroform solution to precipitate polyester. After the supernatant is 
removed by filtration or centrifugation, the precipitates are dried to give purified polyester. 

The resulting polyester is confirmed to be the desired one in a usual manner e.g. by gas chromatography, nuclear 
magnetic resonance etc. 

The gene of the present invention contains the polyester synthase gene isolated from Aeromonas eayjas. This syn- 
thase can synthesize a copolymer (polyester) consisting of a monomer unit 3-hydroxyalkanoic acid represented by for- 
mula I: 
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R 

I <*> 
HO — CH - CH 2 — COOH 

wherein R represents a hydrogen atom or a CI to C4 alkyl group. Said copolymer includes e.g. poly(3-hydroxybutyrate- 
co-3-hydroxyhexanoate) random copolymer (P(3HB-co-3HH)) etc. and the transformant carrying said polyester syn- 
thase gene has the ability to produce P(3HB-co-3HH) with very high efficiency. 

Conventionally, a process for producing poly-3-hydroxybutyrate (P(3HB)) or poly(3-hydroxybutyrate-co-3 -hydroxy- 
valerate) random copolymer F(3HB-co-3HV) has been studied and developed, but such polyester has the problem in 
physical properties of inferior resistance to impact because it is a highly crystalline polymer. 
is Because degree of crystallinity i6 lowered by introducing 3-hydroxyhexanoate having 6 carbon atoms into a poly- 
mer chain, polyester acts as a flexible polymeric material which is also excellent in thermostability and formability, but 
conventional processes for producing P(3HB-co-3HH) by use of Aeromonas caviqe (Japanese Patent Laid Open Pub- 
lication Nos. 93049/1993 and 265065/1995) surfer from a low yield ol polyester. 

In the present invention, the P(3HB-co-3HH) copolyester can be produced in high yield. 
20 Because the desired polyester can be obtained in a large amount using the above means, it can be used as a bio- 
degradable material of yarn or film, various vessels etc. Further, the gene of the present invention can be used to breed 
a strain highly producing the P(3HB-co-3HH) copolymer polyester. 

Examples 

25 

Hereinafter, the present invention is described in more detail with reference to the Examples which however are not 
intended to limit the scope of the present invention. (Example 1] Cloning of the Polyester synthase Gene from Aerom- 
onas caviae 

First, a genomic DNA library was prepared from Aeromonas cayiae. 
30 Aeromonas caviae FA440 was cultured overnight in 100 ml LB medium (1 % yeast extract. 0.5 % trypton, 0.5 % 
sodium chloride, 0.1 % glucose, pH 7.5) at 30 °C and then genomic DNA was obtained from the microorganism using 
the hexadecyl trimethyt ammonium bromide method (Current Protocols in Molecular Biology, vol. 1, page 2.4.3 (1994), 
John Wiley & Sons Inc.). 

The resulting genomic DNA was partially digested with restriction enzyme Sau3AI. The vector plasmid used was 
35 cosmid vector pLA291 7 (ATCC 37355). 

This plasmid was cleaved with restriction enzyme Bglll and dephosphorylated (Molecular Cloning, vol. 1. page 
5.7.2 (1989), CoW Spring Harbor Laboratory) and then ligated into the partially digested genomic DNA fragment by use 
of DNA ligase. 

E- coJl S1 7- 1 was transformed with this ligated DNA fragment by the in yjiro packaging method (Current Protocols 
40 in Molecular Biology, vol. 1, page 5.7.2 (1994)) whereby a genomic DNA library from Aeromonas cayiae was obtained. 
To obtain a DNA fragment containing the polyester synthase gene from Aeromonas caviae, a probe was then pre- 
pared. Two well conserved regions were selected from known amino acid sequences of several polyester synthases, 
and nucleotide sequences coding for them were estimated, and 2 oligonucleotides 5'-CC(C/G)CC(C/G)TGGAT- 
CAA(T/C)AAGT (T/A)(T/C) TA(T/C)ATC-3' (SEQ ID NO:7) and 5'- 

45 (G/C)AGCCA(G/C)GC(G/C)GTCCA(A/G)TC(G/C)GGCCACCA-3' (SEQ ID NO:8) were synthesized. 

The polyester synthase gene was partially amplified by PCR using these oligonucleotides as primers and the 
genomic DNA from Aeromonas caviae as a template. PCR was carried out using 30 cycles, each consisting of reaction 
at 94 °C for 30 seconds, 50 °C for 30 seconds, and 72 °C for 60 seconds. 

Then, this partially amplified fragment was labeled with digoxigenin using a DIG DNA labeling kit (Boehringer Man- 
60 nheim) and used as a probe. 

Using the probe thus obtained, E. coll carrying a plasmid containing the polyester synthase gene was isolated by 
colony hybridization from the genomic DNA library from Aero mona s cayjae. By recovering the plasmid from the E COL 
a DNA fragment containing the polyester synthase gene was obtained. 

The nucleotide sequence of a 3.2 kbp Bgill-EcoRl fragment from this fragment was determined by the Sanger 
55 method. 

As a result, the nucleotide sequence of the 3.2 kb fragment as shown in SEQ ID NOs:9 or 10 was determined. 
By further examining homology to this nucleotide sequence, the polyester synthase gene containing the nucleotide 
sequence (1785 bp) of SEQ ID NO:1 could be identified in this 3.2 kbp nucleotide sequence. 
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h should be understood that insofar as the protein encoded by the polyester synthase gene of the present invention 
has the function of gene expression for polyester polymerization, the nucleotide sequence of said gene may have 
undergone mutations such as deletion, replacement, addition etc. 

In a fragment having the nucleotide sequence of SEQ ID NO:9 or 10. a 405 bp gene (ORF3) and a transcription 
termination region located downstream of the above 1785 bp nucleotide sequence, as well as a 354 bp gene (ORFl) 
and an expression regulatory region located upstream thereof were identified. The nucleotide sequence of ORF1 is 
shown in SEQ ID NO:4; the nucleotide sequence of ORF3 in SEQ ID NO:5; and the amino acid sequence encoded by 
ORF3 in SEQ ID NO: 6. 

ORF3 is an open reading frame of a gene coding for enoyl-CoA hydratase involved in biosynthesis of polyester. 
Insofar as a polypeptide having the amino acid sequence encoded by ORF3 has enoyl-CoA hydratase activity, particu- 
larly (R)-speafic enoyl-CoA hydratase activity, said amino acid sequence may have undergone mutations such as dele- 
tion, replacement and addition of one or more amino acids. 

In the nucleotide sequences of SEQ ID NOS:9 and 10, the expression regulatory region is located at the 1- to 383- 
posrtions and the transcription termination region at the 3010 to 3187- positions. 

[Example 2] Preparation of Alcaligenes eutrophus Transformant 

The Bglll site of the Bglll-EcoRI fragment containing this expression regulatory region, ORF1, the polyester syn- 
thase gene, ORF3, and the transcriptional termination region was made EcoRI-ended by use of an EcoRI linker 
whereby a 3.2 to EcoRI-EcoRI fragment (EE32 fragment) was obtained. This fragment was inserted into plasmid 
pJRD215 (ATCC 37533) capable of expression in microorganisms belonging to the genus Alcaligenes. and the result- 
ing recombinant plasmid was transformed into Alcaligenes eutroohus PHB-4 (DSM 541) (strain deficient in the ability 
to synthesize polyester) by the conjugation transfer method, as follows: 

First, the recombinant plasmid was used to transform E.cpji S17-1 by the calcium chloride method. The recom- 
binant E.coJI thus obtained and Alcaligenes eutroohus PHB-4 were cultured overnight in 1.5 ml LB medium at 30 °C. 
and the respective cultures, each 0.1 ml, were combined and cultured at 30 °C for 4 hours. This microbial mbrture was 
plated on MBF agar medium (0.9 % disodium phosphate, 0. 1 5 % monopotassium phosphate, 0.05 % ammonium chlo- 
ride, 0.5 % fructose, 1 .5 % agar, 0.3 mg/ml kanamycin) and cultured at 30 °C for 5 days. 

Because Alcaligenes eutrophus PHB-4 is rendered resistant to kanamycin by transferring the plasmid in the recom- 
binant E. cpU into it. the colonies grown on the MBF agar medium are a transformant of Alcaligenes eutroohus . One 
colony was isolated from these colonies so that Alcaligenes eutrophus AC32 (referred to hereinafter as AC32) was 
obtained. 

AC32 has been deposited as FERM BP-6038 with the National Institute of Bioscience and Human-Technology, 
Agency of Industrial Science and Technology, Japan. 

A restriction enzyme Bglll sites were introduced respectively into regions upstream and downstream of the ORFt 
gene in the EE32 fragment by site-directed mutagenesis using a synthetic oligonucleotide (Cunent Protocols in Molec- 
ular Biology, vol. 1, page 8.1 .1 (1994)). and an ORF1 gene-free fragment was obtained by deleting the Bglll-Bglll frag- 
ment from the EE32 fragment and then inserted into plasmid pJRD215. The resulting recombinant plasmid was used 
to transform Alcaligenes eutrophus PHB-4 by the conjugation transfer method described above. The resulting trans- 
formant is referred to hereinafter as AC321 . 

Similarly, a restriction enzyme BamHI sites were introduced respectively regions upstream and downstream of the 
ORF3 gene in the EE32 fragment by site-directed mutagenesis, and an ORF3 gene-free fragment was obtained by 
deleting the BamHI-BamHI fragment from the EE32 fragment and then inserted into plasmid DJRD215. Tne resulting 
recombinant plasmid was used to transform Alcaligenes eutrophus PHB-4 by the conjugation transfer method 
described above. The resulting transformant is referred to hereinafter as AC323. 

Similarly, a restriction enzyme Bglll sites were introduced respectively regions upstream and downstream of the 
ORF1 gene and a restriction enzyme BamHI sites were introduced respectively regions upstream and downstream of 
the ORF3 gene in the EE32 fragment, and a gene fragment free of both the ORF1 and ORF3 genes was obtained by 
deleting the Bglll-Bglll and BamHI-BamHI fragments from the EE32 fragment and then inserted into plasmid DJRD215. 
The resulting recombinant plasmid was used to transform Alcaligenes eutrophus PHB-4 by the conjugation transfer 
method described above. The resulting transformant is referred to hereinafter as AC3213. 

Further, the polyester synthase gene was amplified by PCR using the EE32 fragment as a template, and the result- 
ing amplification product was inserted into a region between an expression regulatory region and a transcription termi- 
nation region in a known polyester biosynthesis operon derived from Alcaligenes eutroohus . PCR was carried out using 
5*-AGTTCCCGCCTCGGGTGTGGGTGAA-3* (SEQ ID NO: 11) and 5'-GGCATATGCGCTCATGCGGCGTCCT-3' (SEQ 
ID NO: 1 2) as primers in 30 cycles each consisting of reaction at 94 °C for 30 seconds, 55 °C for 30 seconds and 72 °C 
for 60 seconds. 

This DNA fragment was inserted into plasmid pJRD215. and the resulting plasmid was used to transform Alcali- 
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genes eutrophus PHB-4 by the conjugation transfer method described above. The resulting transformant is referred to 
hereinafter as AC29. 

[Example 3] Synthesis of Polyester by Alcaliqenes eutrophus Transformarrts 

Each of Alcalioenes eutrophus H16, PHB-4, AC32, AC321, AC323, AC3213 and AC29 was inoculated into 95 ml 
MB medium (0.9 % disodium phosphate, 0.15 % monopotassium phosphate, 0.05 % ammonium chloride) containing 1 
ml of 1 % sodium octanate and incubated in a flask at 30 °C. 0.2 g/L kanamycin was contained in the mediums for 
strains AC32, AC321, AC323, AC3213 and AC29. 12, 24, 36 and 48 hours thereafter, 1 ml of 1 % sodium octanate was 
added to each medium (total amount of sodium octanate added: 0.5 g) and the microorganisms were cultured for 72 
hours. 

Each of strains H1 6 and AC3213 was inoculated into the above MB medium to which 1% olive oil, palm oil, corn oil 
or oleic acid had been added, and each strain was cultured at 30 °C for 72 hours in a flask. 0.2 g/L kanamycin was con- 
tained in the mediums for strain AC321 3. 

Each of strains H16, AC32, AC321, AC323 and AC3213 was inoculated into the above MB medium to which 1% 
sodium heptanoate had been added, and each strain was cultured at 30 °C in a flask. 0.2 g/L kanamycin was contained 
in the mediums for strains AC32, AC321, AC323 and AC3213. 

While 1 ml of 1% sodium heptanoate was added to each medium (total amount of sodium heptanoate added: 0.5 
g) 12, 24, 36 and 48 hours thereafter, the microorganisms were cultured for 72 hours. 444 

The microorganisms were recovered by centrifugation, washed with distilled water and lyophilized, and the weight 
of the dried microorganisms was determined. 2 mi sulfuric acid/methanol mixture (15 : 85) and 2 ml chloroform were 
added to 10-30 mg of the dried microorganism, and the sample was sealed and heated at 100 °C for 140 minutes 
whereby the polyester in the microorganisms was decomposed into methylester. 1 ml distilled water was added thereto 
and stirred vigorously. It was left and separated into 2 layers, and the lower organic layer was removed and analyzed 
for its components by capillary gas chromatography through a capillary column Neutra BOND-1 (column of 25 m in 
length, 0.25 mm in inner diameter and 0.4 pm in liquid film thickness, manufactured by GL Science) in Shimadzu GC- 
1 4A. The temperature was raised at a rate of 8 °C/min. from an initial temperature of 1 00 °C. The results are shown in 
Tables 1, 2 and 3. 



Table 1 



Synthesis of Polyester Using Octanoic Acid as Carbon Source 


Strain Used A. eutrophus 


Weight of Dried Microor- 
ganism (g/l) 


Content of Polyester in 
Dried Microorganism 
(weight-%) 


Polyester Comp. 








3HB 


3HH 








(mole-%) 


H16 


3.00 


86 


100 


0 


PHB-4 


0.80 


0 






AC32 


0.99 


33 


78 


23 


AC321 


2.85 


92 


87 


13 


AC323 


2.85 


92 


88 


12 


AC3213 


3.64 


96 


85 


15 


AC29 


3.20 


94 


92 


8 


3HB: 3-hydroxybutyrate, 3HH: 3-hydroxyhexanoate 
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Table 2 



Synthesis of Polyester Using Vegetable Oil or Oleic Acid as Carbon Source 


Strain Used A, 
eutrophys 


Carbon Source 


Weight of Dried 
Microorganism (g/l) 


Content of Polyester in 
Dried Microorganism 
(weight-%) 


Polyester Comp. 










3HB 


3HH 










(mole-%) 


H16 


olive oil 


4.27 


79 


100 


0 




corn oil 


3.57 


81 


100 


0 




palm oil 


4.13 


79 


100 


0 




oleic acid 


4.06 


82 


100 


0 


AC3213 


olive oil 


3.54 


76 


96 


4 




com oil 


3.60 


77 


95 


5 




palm oil 


3.58 


81 


96 


4 




oleic acid 


2.22 


70 


96 


4 


3HB: 3-hydroxybutyrate, 3HH: 3-hydroxyhexanoate 



Table 3 



Synthesis of Polyester Using Heptanoic Acid as Carbon Source 


Strain Used A. eutroohus 


Weight of Dried Microor- 
ganism (g/l) 


Content of Polyester in 
Dried Microorganism 
(weight-%) 


Polyester Comp. 








3HB 


3HV 


3HHp 








(mole-%) 


H16 


2.50 


60 


50 


50 


0 


AC32 


0.77 


7 


30 


67 


5 


AC321 


1.67 


55 


46 


52 


2 


AC323 


1.27 


40 


48 


45 


7 


AC3213 


2.76 


67 


44 


48 


8 


3HB: 3-hydroxybutyrate, 3HH: 3-hydroxyhexanoate, 3HHp: 3 -hydroxy heptanoate 



As 6hown in Table 1. H16 (i.e. wild-type AJ call genes eutrophus) synthesized a poly(3-hydroxybutyrate) homopoly- 
mer. This is because 3HH (3-hydroxyhexanoate) having 6 carbon atoms does not serve as a substrate for the polyester 
synthase possessed by H1 6. PHB-4 (i.e. the same strain as H16 but deficient in the ability to synthesize polyester) lacks 
the polyester synthase and thus does not accumulate polyester. AC32 prepared by introducing into PHB-4 the EE32 
fragment containing the polyester synthase gene derived from Aeromonas caviae accumulated the poly(3 -hydroxy bur - 
ylate-co-3-hydroxyhexanoate) random copolymer (P(HB-co-3HH)) containing 22 mole-% 3HH (3-hydroxyhexanoate). 
and this copolymer accounted for 33 % by weight of the dried microorganism. 

AC321, AC323 and AC3213 accumulated P(3HB-co-3HH) containing 12 to 15 mole : % 3HH, and the copolymer 
accounted for 92 to 96 % by weight of the dried microorganisms. As can be seen from these results, the ability of these 
strains to accumulate polyester was significantly improved by deleting the ORF1 gene and/or ORF3 gene. 

P(3HB-co-3HH) was also accumulated in an amount of 94 % by weight of the microorganism even in the case of 
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AC29 carrying the polyester synthase gene derived from A. caviae whose expression regulatory region and transcrip- 
tional termination region had been replaced by those derived from Alcajig e nes . eutrophus, indicating that the yield of 
polyester was significantly improved even using the expression regulatory region and transcriptional termination region 
of different origin. 

5 When AC3213 producing polyester in the highest yield was cultured using olive oil. corn oil or palm oil as a carbon 

source, the microorganism accumulated P(3HB-co-3HH) containing 4 to 5 mole-% 3HH, where the copolymer 
accounted for 76 to 81 % by weight of the microorganism, as shown in Table 2. Even if oleic acid as an fatty acid com- 
ponent contained most abundantly in vegetable oils was used as a carbon source, AC3213 accumulated P(3HB-co- 
3HH) containing 4 mole-% 3HH, where the copolymer accounted for 70 % by weight of the microorganism. Its corre- 

w sponding wild strain H16 synthesized only poIy(3-hydroxybutyTate) homopolymer under the same conditions. 

Alcaligenes eutrophus FA440 is reported to have accumulated 8 % by weight of P(3HB-co-3HH) by use of palmitic 
acid as a carbon source (Japanese Patent Laid Open Publication No. 265065/1995). On the other hand, the transform- 
ant according to the present invention has accumulated 96 % by weight of P(3HB-co-3HH) by use of octanoic acid as 
a carbon source and 76 to 81 % by weight of P(3HB-co-3HH) by use of extremely cheap vegetable oils as a carbon 

is source, so the comparison therebetween indicates that the method of synthesizing P(3HB-co-3HH) by the transformant 
used in the present example is an extremely superior method. 

When heptanoic acid was used as a carbon source, H16, that is a wild strain of Alcalioenes eutrophus. synthesized 
poly(3-hydroxybutyrate-co-3-hydroxyvalerate) copolymer (P(3HB-co-3HV)). This is because 3HHp (3-hydroxyhep- 
tanoate) having 7 carbon atoms does not serve as a substrate for the polyester synthase possessed by H16, AC32, 

20 derived from PHB-4 by introduction of the EE32 fragment containing the polyester synthase gene derived from Aerom- 
onas caviae . accumulated pory(3-hydroxybutyrate-co-3-hydroxy\^lerate-co-3-hydroxyheptanoate) terpolymer (P(3HB- 
co-3HV-co-3HHp)) containing 5 mole-% 3HHp, where this copolymer accounted for 7 % by weight of the dried microor- 
ganism. 

Further, each of strains AC321 , AC323 and AC3213 accumulated P(3HB-co-3HV-co-3HHp) containing 2 to 8 mole- 
25 % 3HHp where the copolymer accounted for 40 to 67 % by weight of the microorganisms, indicating that the yield of 
polyester was significantly improved by deleting the ORF1 gene and/or ORF3 gene (Table 3). 

From these results, it is concluded that copolye6ters consisting of 3-hydroxyalkanoic acid with 4 to 7 carbon atoms 
can be synthesized using the polyester synthase derived from Aeromonas caviae. 

30 [Example 4] Identification of Functions of ORF3 

The ORF3 gene was amplif ied by PCR using the EE32 fragment as a template and then inserted into a sHe down- 
stream of T7 promoter in expression plasmid PET-3a (Novagene). PCR was canied out using S'-GCCATATGAGCG- 
CACAATCCCTGGAAGTAG-3' (SEQ ID NO:13) and 5'-CTGGGATCCGCCGGTGCTTAAGGCAGCTTG-3' (SEQ ID 
35 NO: 14) as primers in 25 cycles each consisting of reaction at 95 °C for 60 seconds and 68 °C for 30 seconds. The 
resulting plasmid was used to transform E. cpji BL21 (DE3) (Novagene). The resulting transformant is designated NB3. 

NB3 was cultured in LB medium at 30 °C for 4 hours, and isopropyl-p-D-thiogalactopyranoside (IPTG) was added 
at a final concentration of 0.4 mM to induce expression, and it was further cultured at 30 °C for 2 hours. The microor- 
ganism was recovered by centrrfugation, disrupted by ultrasonication and centrifuged to give a soluble protein fraction. 
40 As shown in Table 4, high enoyl-CoA hydratase activity was detected in the soluble fraction from the microorganism 
having the expression plasmid introduced into it. 

Table 4 



Specific Activity of Enoyl-CoA Hydratase 
in Soluble Protein Fraction 

_ (unit/mq protein) 

E. COli BL21/PET-3a 0 

E. coli NB3 X7Q0 



The enoyl-CoA hydratase activity was determined by measuring a change in absorbance (263 nm) due to double 
bond hydration, using crotonyl-CoA (Sigma) as substrate (concentration: 0.25 mM). No activity was detected in £. coli 
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into which the control plasmid PET- 3a free of the ORF3 gene had been introduced. 

Then, the enoyf-CoA hydratase protein was purified. A soluble protein fraction from NB3 was applied to an anion 
exchange column Q-Sepharose (Pharmacia) and eluted with a gradient of (0 to 1 M) NaCI, and a fraction with enoyi- 
CoA hydratase activity was collected. SDS-PAGE analysis indicated that the active fraction was homogenous in elec- 
5 trophoresis as shown in FIG. 2. In addition, about 3-fold specific activity could be attained as shown in Table 5. 

Table 5 

Specific Activity of Enoyl-CoA Hydratase 

10 

(unit/mg protein \ 

H . coli NB3 soluble protein fraction 17 00 
is anion eycfrangq column elutjon fraction 5100 



The N-terminal amino acid sequence of the enoyl-CoA hydratase protein thus purified was determined. As shown 
zo in Table 6, the determined amino acid sequence was the same except for Met in the initiation codon as the amino acid 
sequence deduced from the nucleotide sequence of the ORF3 gene. 

Table 6 

" s Comparison between Amino Acid Sequences 

( unit/mg prptQinl 

30 N-terminal amino acid sequence of 

purified enoyl-CoA hydratase: SAQSLEVGQKARLSKRFGAA (SEQ ID NO: 15) 
amino acid sequence deduced from 

ORF3 nucleotide sequence: MSAQSLEVQOKARLSKRFGAA ( SEQ ID NO: 16 ) 



40 From this, it could be confirmed that ORF3 codes for enoyl-CoA hydratase. It is considered that Met was released 
by post-translational modification. 

Further, the stereospecificity of enoyl-CoA hydratase encoded by ORF3 was examined as follows: 
By adding (S)-3-hydroxyt>utyryl-CoA dehydrogenase (Sigma) (final concentration: 0.2 unit/ml) and oxidized nicoti- 
namide adenine cSnucleotide (NAD+) (final concentration: 0.5 mM) to a reaction solution for activity measurement, (S)- 
45 3-hydroxybutyryl-CoA formed is oxidized to acetoacetyl-CoA by the action of the dehydrogenase if the enoyl-CoA 
hydratase is specific to the (S)-isomer. During this reaction. NAD+ is reduced to form NADH resulting in the generation 
of a specific absorption at 340 nm. H enoyt-CoA hydratase is specific to the (R)- isomer, NADH is not formed. 

As shown in Table 7, the change in absoruance at 340 nm when enoyl-CoA hydratase encoded by ORF3 was used, 
was the same as in the case where enoyl-CoA hydratase was absent, but if commercially available (S)-specrfic enoyl- 
50 CoA hydratase (Sigma) was used, a change in absorbance due to formation of NADH was observed. 
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Table 7 



Change in Absorbance at 340 nm after 1 Min. 


no addition of enoyl-CoA hydratase 




0.045 


ORF3-derived enoyl-CoA hydratase 




0.047 


(S)-isomer specific enoyl-CoA hydratase (Sigma) 




0.146 



10 



15 



From this result, it was made evident that the purified enoyl-CoA hydratase is specific to the (R)-isomer. Thus, it 
was found that ORF3 codes for (R)-isomer specific enoyl-CoA hydratase. 

According to the present invention, there are provided a polyester synthase, a recombinant vector carrying the 
gene, a transformant carrying the recombinant vector and a process for producing polyester by use of the transformant 

The present invention is extremely useful in that the present gene codes for a polyester synthase capable of syn- 
thesizing polyester as a copolymer consisting of a monomer unit represented by 3-hydroxyalkanoic acid having 4 to 7 
carbon atoms, and that the present process can synthesize a biodegradable plastic P(3HB-co-3HH) very efficiently 
which is excellent in thermostability and formability. 



20 



30 



35 



40 



46 



60 
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SEQUENCE LISTING 
(I) GENERAL INFORMATION: * 

5 

(i) APPLICANT: 

(A) NAME: THE INSTITUTE OP PHYSICAL AND CHEMICAL RESEARCH 

(B) STREET: Hirosawa 2-1 

(C) CITY: Wako-ahi 

(D) STATE: Saitaraa 
70 (E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP) : 351 01 

(G) TELEPHONE: 81-48-467-9263 

(H) TELEPAX: 81-48-462-4609 



1S (ii) TITLE OF INVENTION: POLYESTER SYNTHASE GENE AND PROCESS POR PRODUCING 

POLYESTER 

(iii) NUMBER OF SEQUENCES: 16 

(iv) COMPUTER READABLE PORM: 

(A) MEDIUM TYPE: PlOppy disk 
20 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, VerBion &1.30 <EPO) 



25 



(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: 97113932.4 

<vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 214509/1996 

(B) FILING DATE: 14-AUG1996 

(vi) PRIOR APPLICATION DATA: 
30 (A) APPLICATION NUMBER: JP 19 9979/1997 

<B) PILING DATE: 25-JUL-1997 



(2) INFORMATION POR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic} 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B> LOCATION. 1. . 1782 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



50 



ATG 


AOC 


CAA 


CCA 


TCT 


TAT 


GQC 


CCG 


CTG 


TTC 


GAG 


GCC 


CTG 


GCC CAC 


TAC 


48 


Met 


Ser 


Gin 


Pro 


Ser 


Tyr 


Gly 


Pro 


Leu 


Phe 


Glu 


Ala 


Leu 


Ala His 


Tyr 




i 








5 










10 








15 






AAT 


GAC 


AAG 


CTG 


CTG 


GCC 


ATG 


GCC 


AAG 


GCC 


CAG 


ACA 


GAG 


GGC ACC 


GCC 


96 


Asn 


Asp 


Lya 


Leu 


Leu 


Ala 


Met 


Ala 


Lys 


Ala 


Gin 


Thr 


Glu Arg Thr Ala 










20 










25 










30 






CAG 


GCG 


CTG 


CTG 


CAG 


ACC 


AAT 


CTG 


GAC 


GAT 


CTG 


GGC 


CAG 


GTG. CTG 


GAG 


144 


Gin 


Ala 


Leu 


Leu 


Gin 


Thr 


Asn 


Leu 


Asp 


Asp 


Leu 


Gly 


Gin 


Val Leu 


Glu 





13 



EP 0 824 148 A2 



CAG GGC AGC CAG CAA CCC TGG CAG CTG ATC CAG GCC CAG ATG AAC TGG 
Gin Gly Ser Gin Gin Pro Trp Gin Leu He Gin Ala Gin Met Asn Trp 

50 55 60 > 

TGG CAG GAT CAG CTC AAG CTG ATG CAG CAC ACC CTG CTC AAA AGC GCA 
Trp Gin Asp Gin Leu Lys Leu Met Gin His Thr Leu Leu Lys Ser Ala 
65 70 7 5 80 

GGC CAG CCG AGC GAG CCG GTG ATC ACC CCG GAG CGC AGC GAT CGC CGC 
Gly Gin Pro Ser Glu Pro Val lie Thr Pro Glu Arg Ser Asp Arg Arg 

85 90 95 

TTC AAG GCC GAG GCC TGG AGC GAA CAA CCC ATC TAT GAC TAC CTC AAG 
Phe Lys Ala Glu Ala Trp Ser Glu Gin Pro He Tyr Asp Tyr Leu Lys 

100 105 
CAG TCC TAC CTG CTC ACC GCC AGG CAC CTG CTG GCC TCG GTG GAT GCC 
Gin Ser Tyr Leu Leu Thr Ala Arg His Leu Leu Ala Ser Val Asp Ala 

115 120 125 

CTG GAG GGC GTC CCC CAG AAG AGC CGG GAG CGG CTG CGT TTC TTC ACC 
Leu Glu Gly Val Pro Gin Lys Ser Arg Glu Arg Leu Arg Phe Phe Thr 

130 135 140 

CGC CAG TAC GTC AAC GCC ATG GCC CCC AGC AAC TTC CTG GCC ACC AAC 
Arg Gin Tyr Val Asn Ala Met Ala Pro Ser Asn Phe Leu Ala Thr Asn 
145 150 155 160 

CCC GAG CTG CTC AAG CTG ACC CTG GAG TCC GAC GGC CAG AAC CTG GTG 
Pro Glu Leu Leu Lys Leu Thr Leu Glu Ser Asp Oly Gin Asn Leu Val 

165 HO l 7 ^ 

CGC GGA CTG GCC CTC TTG GCC GAG GAT CTG GAG CGC AGC GCC GAT CAG 
Arg Gly Leu Ala Leu Leu Ala Glu Asp Leu Glu Arg Ser Ala Asp Gin 

180 185 
CTC AAC ATC CGC CTG ACC GAC GAA TCC GCC TTC GAG CTC GGG CGG GAT 
Leu Asn He Arg Leu Thr Asp Glu Ser Ala Phe Glu Leu Gly Arg Asp 

19S 200 205 

CTG GCC CTG ACC CCG GGC CGG GTG GTG CAG CGC ACC GAG CTC TAT GAG 
Leu Ala Leu Thr Pro Gly Arg Val Val Gin Arg Thr Glu Leu Tyr Glu 

210 215 220 

CTC ATT CAG TAC AGC CCG ACT ACC GAG ACG GTG GGC AAG ACA CCT GTG 
Leu He Gin Tyr Ser Pro Thr Thr Glu Thr Val Gly Lys Thr Pro Val 
225 230 235 240 

CTG ATA GTG CCG CCC TTC ATC AAC AAG TAC TAC ATC ATG GAC ATG CGG 
Leu He val Pro Pro Phe He Asn Lys Tyr Tyr He Met Asp Met Arg 

245 250 255 

CCC CAG AAC TCC CTG GTC GCC TGG CTG GTC GCC CAG GGC CAG ACG GTA 
Pro Gin Asn Ser Leu Val Ala Trp Leu Val Ala Gin Gly Gin Thr Val 

260 265 270 

TTC ATG ATC TCC TGG CGC AAC CCG GGC GTG GCC CAG GCC CAA ATC GAT 
Phe Met lie Ser Trp Arg Asn Pro Gly Val Ala Gin Ala Gin He Asp 

275 280 285 

CTC GAC GAC TAC GTG GTG GAT GGC GTC ATC GCC GCC CTG GAC GGC GTG 
Leu Asp Asp Tyr val val Asp Gly Val He Ala Ala Leu Asp Gly Val 

290 295 300 

GAG GCG GCC ACC GGC GAG CGG GAG GTG CAC GGC ATC GGC TAC TGC ATC 
Glu Ala Ala Thr Gly Glu Arg Glu Val His Gly He Gly Tyr Cya He 
305 310 315 320 

GGC GGC ACC GCC CTG TCG CTC GCC ATG GGC TGG CTG GCG GCG CGG CGC 
Gly Gly Thr Ala Leu Ser Leu Ala Met Gly Trp Leu Ala Ala Arg Arg 

325 330 335 

CAG AAG CAG CGG GTG CGC ACC GCC ACC CTG TTC ACT ACC CTG CTG GAC 
Gin Lys Gin Arg val Arg Thr Ala Thr Leu Phe Thr Thr Leu Leu Asp 

340 345 350 

TTC TCC CAG CCC GGG GAG CTT GGC ATC TTC ATC CAC GAG CCC ATC ATA 
Phe Ser Gin Pro Gly Glu Leu Gly He Phe He His Glu Pro lie He 

355 360 365 

GCG GCG CTC GAG GCG CAA AAT GAG GCC AAG GGC ATC ATG GAC GGG CGC 
Ala Ala Leu Glu Ala Gin Asn Glu Ala Lys Gly He Met Asp Gly Arg 

370 375 380 

CAG CTG GCG GTC TCC TTC AGC CTG CTG CGG GAG AAC AGC CTC TAC TGG 



40 45 
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Gin Leu Ala Val Ser Phe Ser Leu Leu Arg Glu Asn 9er Leu Tyr Trp 
? 385 390 395 400 

AAC TAC TAC ATC GAC AGC TAC CTC AAG GGT CAG AGC CCG CTG GCC TTC • 12 4 8 

Asn Tyr Tyr lie Asp Ser Tyr Leu Lys Gly Gin Ser Pro Val Ala Phe? 
5 ~ 405 410 415 

GAT CTG CTG CAC TGG AAC AGC GAC AGC ACC AAT GTG GCG GGC AAG ACC 1296 

Asp Leu Leu His Trp Aen Ser Asp Ser Thr Aan Val Ala Gly Lys Thr 

420 425 430 

CAC AAC AGC CTG CTG CGC CGT CTC TAC CTG GAG AAC CAG CTG GTG AAG 1344 

His Asn Ser Leu Leu Arg Arg Leu Tyr Leu Glu Asn Gin Leu Val Lys 
10 435 440 445 

GGG GAG CTC AAG ATC CGC AAC ACC CGC ATC GAT CTC GGC AAG GTG AAG 1392 

Gly Glu Leu Lys lie Arg Asn Thr Arg lie Asp Leu Gly Lys Val Lys 

450 455 460 

ACC CCT GTG CTG CTG GTG TCG GCG GTG GAC GAT CAC ATC GCC CTC TGG 1440 

Thr Pro Val Leu Leu Val Ser Ala val Asp Asp Hib lie Ala Leu Trp 
465 470 475 480 

CAG GGC ACC TGG CAG GGC ATG AAG CTG TTT GGC GGG GAG CAG CGC TTC 14 88 

Gin Gly Thr Trp Gin Gly Met Lys Leu Phe Gly Gly Olu Gin Arg Phe 

485 490 495 

CTC CTG GCG GAG TCC GGC CAC ATC GCC GGC ATC ATC AAC CCG CCG GCC 1536 

Leu Leu Ala Glu Ser Gly His lie Ala Gly He lie Asn Pro Pro Ala 

500 505 510 

20 GCC AAC AAG TAC GGC TTC TGG CAC AAC GGG GCC GAG GCC GAG AGC CCG 1584 

Ala Asn Lye Tyr Gly Phe Trp His Asn Gly Ala Glu Ala Glu Ser Pro 

515 520 525 

GAG AGC TGG CTG GCA GGG GCG ACG CAC CAG GGC GGC TCC TGG TGG CCC 163 2 

Glu Ser Trp Leu Ala Gly Ala Thr His Gin Gly Gly Ser Trp Trp Pro 
530 535 540 

25 GAG ATG ATG GGC TTT ATC CAG AAC CGT GAC GAA GGG TCA GAG CCC GTC 1680 

Glu Met Met Gly Phe He Gin Asn Arg Asp Glu Gly Ser Glu Pro Val 
545 550 555 560 

CCC GCG CGG GTC CCG GAG GAA GGG CTG GCC CCC GCC CCC GGC CAC TAT 1728 

Pro Ala Arg Val Pro Glu Glu Gly Leu Ala Pro Ala Pro Gly His Tyr 

565 570 575 

GTC AAG GTG CGG CTC AAC CCC GTG TTT GCC TGC CCA ACA GAG GAG GAC 17 76 

Val Lya Val Arg Leu Asn Pro Val Phe Ala CyB Pro Thr Glu Glu Asp 

580 585 590 
GCC GCA TGA 178 5 

Ala Ala 

35 (2) INFORMATION FOR SEQ ID NOi 2: 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 594 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS : 

40 ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 2i 



30 



45 



Met Ser Gin Pro Ser Tyr Gly Pro Lou Phe Glu Ala Leu Ala Hie Tyr 
1 5 10 15 



Asn Asp Lya Leu Leu Ala Met Ala Lys Ala Gin Thr Glu Arg Thr Ala 

20 25 30 

Gin Ala Leu Leu Gin Thr Asn Leu Asp Asp Leu Gly Gin Val Leu Glu 
35 40 45 

60 Gin Gly Ser Gin Gin Pro Trp Gin Leu lie Gin Ala Gin Met Asn Trp 

50 55 60 

Trp Gin Asp Gin Leu Lys Leu Met Gin His Thr Leu Leu Lys Ser Ala 
65 70 75 80 



55 



15 



EP 0 824 148 A2 



Gly Gla Pro Ser Glu Pro val He Thr Pro Glu Arg Ser Asp Arg : Arg 

85 90 95 

Phe Lys Ala Glu Ala Trp Ser Glu Gin Pro lie Tyr Asp Tyr Leu Lys 
. 100 105 HO 

Gin Ser Tyr Leu Leu Thr Ala Arg His Leu Leu Ala Ser Val Asp Ala 

115 120 125 

Leu Glu Gly Val Pro Gin Lys Ser Arg Glu Arg Leu Arg Phe Phe Thr 

130 135 140 

Arg Gin Tyr Val Asn Ala Met Ala Pro Ser Asn Phe Leu Ala Thr Asn 
145 150 155 160 

Pro Glu Leu Leu Lys Leu Thr Leu Glu Ser Asp Gly Gin Aan Leu Val 

165 l" 70 I 75 

Ara Gly Leu Ala Leu Leu Ala Glu Asp Leu Glu Arg Ser Ala Asp Gin 

180 165 190 

Leu Asn Tie Arg Leu Thr Asp Glu Ser Ala Phe Glu Leu Gly Arg Asp 

195 200 205 

Leu Ala Leu Thr Pro Gly Arg val val Gin Arg Thr Glu Leu Tyr Glu 

210 215 220 

Leu He Gin Tyr Ser Pro Thr Thr Glu Thr val Gly Lys Thr Pro Val 
225 230 235 240 

Leu He val Pro Pro Phe He Asn Lys Tyr Tyr He Met Asp Met Arg 

245 250 255 

Pro Gin Asn Ser Leu val Ala Trp Leu Val Ala Gin Gly Gin Thr Val 

260 265 270 

Phe Met He Ser Trp Arg Asn Pro Gly Val Ala Gin Ala Gin He Asp 

275 280 285 

Leu asp Asp Tyr Val val Asp Gly Val He Ala Ala Leu Asp Gly Val 

290 295 300 

Glu Ala Ala Thr Gly Glu Arg Glu Val His Gly ile^Gly Tyr Cyfi He 
305 310 315 320 

Gly Gly Thr Ala Leu 3er Leu Ala Met Gly Trp Leu Ala Ala Arg Arg 

325 330 335 

Gin Lys Gin Arg val Arg Thr Ala Thr Leu Phe Thr Thr Leu Leu Asp 

340 345 350 

Phe Ser Gin Pro Gly Glu Leu Gly He Phe He His Glu Pro He He 

355 360 365 

Ala Ala Leu Glu Ala Gin Asn Glu Ala Lys Gly He Met Asp Gly Arg 

370 375 380 

Gin Leu Ala val Ser Phe Ser Leu Leu Arg Glu Asn Ser Leu Tyr Trp 
385 390 395 400 

Asn Tyr Tyr He Asp Ser Tyr Leu Lys Gly Gin Ser Pro Val Ala Phe 

405 410 415 

Asp Leu Leu His Trp Asn Ser Asp Ser Thr Asn Val Ala Gly LyB Thr 

420 425 430 

His Asn Ser Leu Leu Arg Arg Leu Tyr Leu Glu Asn Gin Leu Val Lys 

435 440 445 

Gly Glu Leu Lys He Arg Asn Thr Arg He Asp Leu Gly Lys Val Lys 

450 455 460 

Thr Pro Val Leu Leu Val Ser Ala Val Asp Asp His He Ala Leu Trp 
465 470 475 480 

Gin Gly Thr Trp Gin Gly Met LyB Leu Phe Gly Gly Glu Gin Arg Phe 

485 490 495 

Leu Leu Ala Glu Ser Gly His He Ala Gly He He Asn Pro Pro Ala 

500 505 510 

Ala Asn Lys Tyr Gly Phe Trp His Asn Gly Ala Glu Ala Glu Ser Pro 

515 520 525 

Glu Ser Trp Leu Ala Gly Ala Thr His Gin Gly Gly Ser Trp Trp Pro 

530 535 540 

Glu Met Met Gly Phe He Gin Aan Arg Asp Glu Gly Ser Glu Pro Val 
545 550 555 560 

pro Ala Arg Val Pro Glu Glu Gly Leu Ala Pro Ala Pro Gly His Tyr 

565 570 575 

val Lys val Arg Leu Asn Pro Val Phe Ala Cys Pro Thr Glu Glu Asp 
580 585 590 

Ala Ala 



10 



15 



20 



25 



30 



35 



45 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/R5Y: CDS 

(B) LOCATIONrl . .351 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATG 


ATG 


AAT 


ATG 


GAC 


GTG 


ATC 


AAG 


AGC 


TTT 


ACC 


GAG 


CAG 


ATG 


CAA 


GGC 


U»4S 


Met 


Mat 


Asn 


Met 


Asp 


val 


He 


Lye 


Ser 


Phe 


Thr 


Glu 


Gin 


Met 


Gin 


Gly 




1 








5 










10 










15 






TTC 


GCC 


GCC 


CCC 


CTC 


ACC 


CGC 


TAC 


AAC 


CAG 


CTG 


CTG 


GCC 


AGC 


AAC 


ATC 


U»9 6 


Phe 


Ala 


Ala 


Pro 
20 


Leu 


Thr 


Arg 


Tyr 


Asn 
25 


Gin 


Leu 


Leu 


Ala 


Ser 
30 


Asn 


He 




GAA 


CAG 


CTG 


ACC 


CGG 


TTG 


CAG 


CTG 


GCC 


TCC 


GCC 


AAC 


GCC 


TAC 


GCC 


GAA 


144 


Clu 


Gin 


Leu 
35 


Thr 


Arc 


Leu 


Gin 


Leu 
40 


Ala 


Ser 


Ala 


Asn 


Ala 
45 


Tyr 


Ala 


Glu 




CTG 


GGC 


CTC 


AAC 


CAG 


TTG 


CAG 


GCC 


GTG 


AGC 


AAG 


GTG 


CAG 


GAC 


ACC 


CAG 


192 


Leu 


Gly 
50 


Leu 


Asn 


Gin 


Leu 


Gin 
55 


Ala 


Val 


Ser 


Lys 


Val 
60 


Gin 


Asp 


Thr 


Gin 




AGC 


CTG 


GCG 


GCC 


CTG 


GGC 


ACA 


GTG 


CAA 


CTG 


GAG 


ACC 


GCC 


AGC 


CAG 


CTC 


240 


Ser 


Leu 


Ala 


Ala 


Leu 


Gly 


Thr 


Val 


Gin 


Leu 


Glu 


Thr 


Ala 


Ser 


Gin 


Leu 




65 










70 










75 










80 




TCC 


CGC 


CAG 


ATG 


CTG 


GAT 


GAC 


ATC 


CAG 


AAG 


CTG 


AGC 


GCC 


CTC 


GGC 


CAG 


288 


Ser 


Arg 


Gin 


Met 


Leu 
85 


ASP 


ASp 


He 


Gin 


Lys 
90 


Leu 


Ser 


Ala 


Leu 


Gly 
95 


Gin 




CAG 


TTC 


AAG 


GAA 


GAG 


CTG 


GAT 


GTC 


CTG 


ACC 


GCA 


GAC 


GGC 


ATC 


AAG 


AAA 


336 


Gin 


Phe 


Lys 


Glu 
100 


Glu 


Leu 


Asp 


Val 


Leu 
105 


Thr 


Ala 


Asp 


Gly 


He 
110 


Lys 


Lys 




AGC 


ACG 


GGC 


AAG 


GCC 


TGA 






















354 


Ser 


Thr 


Gly 
115 


Lyg 


Ala 



























(2) INFORMATION FOR SEQ ID NO : 4i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 
(CJ STRANDEDNESS: 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 



Met Met Asn Met Asp val He Lys Ser Phe Thr Glu Gin Met Gin Gly 

1 5 10 15 

Phe Ala Ala Pro Leu Thr Arg Tyr Asn Gin Leu Leu Ala Ser A3n He 

20 25 30 

Glu Gin Leu Thr Arg Leu Gin Leu Ala Ser Ala Asn Ala Tyr Ala Glu 

35 40 45 

60 Leu Gly Leu Asn Gin Leu Gin Ala Val Ser Lys Val Gin Asp Thr Gin 

50 55 60 

Ser Leu Ala Ala Leu Gly Thr Val Gin Leu Glu Thr Ala Ser Gin Leu 



55 



17 



to 



15 



20 



25 



30 



35 



55 
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65 70 75 30 

Ser Arg Gin Met Leu Asp Asp He Gin Lys Leu Ser Ala Leu Gly Gin 

85 90 95 

bin Phe Lys Glu Glu Leu Asp Val Leu Thr Ala Asp Gly. lie Lys Lys? 

100 105 HO 

Ser Thr Gly Lys Ala 
115 

2} INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS » double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



Cix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .402 





(xi) 


SEQUENCE DESCRIPTION! SEQ ID NC 


>: 5: 












ATG 


AGC 


GCA 


CAA 


TCC 


CTG 


GAA 


GTA 


GGC 


CAG 


AAG 


GCC 


CGT 


CTC 


AGC 


AAG 


Met 


Ser 


Ala 


Gin 


ser 


Leu 


Glu 


Val 


Gly 


Gin 


Lys 


Ala 


Arg 


Leu 


Ser 


Lys 


1 






5 










10 










15 




CGG 


TTC 


GGG 


GCG 


GCG 


GAG 


GTA 


GCC 


GCC 


TTC 


GCC 


GCG 


CTC 


TCG 


GAG 


GAC 


Arg 


Phe 


Gly 


Ala 


Ala 


Glu 


Val 


Ala 


Ala 


Phe 


Ala 


Ala 


Leu 


Ser 


Glu 


Abp 




20 










25 










30 




TTC 


TTC 


AAC 


CCC 


CTG 


CAC 


CTG 


GAC 


CCG 


GCC 


TTC 


GCC 


GCC 


ACC 


ACQ 


GCG 


Phe 


Asn 


Pro 


Leu 


His 


Leu 


ASp 


Pro 


Ala 


Phe 


Ala 


Ala 


Thr 


Thr 


Ala 


Phe 






35 










40 










45 








GAG 


CGG 


CCC 


ATA 


GTC 


CAC 


GGC 


ATG 


CTG 


CTC 


GCC 


AGC 


CTC 


TTC 


TCC 


GGG 


Glu 


Arg 


Pro 


He 


val 


HiS 


Gly 


Met 


Leu 


Leu 


Ala 


Ser 


Leu 


Phe 


Ser 


Gly 




50 










55 










60 










CTG 


CTG 


GGC 


CAG 


CAG 


TTG 


CCG 


GGC 


AAG 


GGG 


AGC 


ATC 


TAT 


CTG 


GGT 


CAA 


Leu 


Leu 


Gly 


Gin 


Gin 


Leu 


Pro 


Gly 


Lys 


Gly 


Ser 


He 


Tyr 


Leu 


Gly Gin 


65 






70 










75 










80 


AGC 


CTC 


AGC 


TTC 


AAG 


CTG 


CCG 


GTC 


TTT 


GTC 


GGG 


GAC 


GAG 


GTG 


ACG 


GCC 


Ser 


Leu 


ser 


phe 


Lys 


Leu 


Pro 


Val 


Phe 


val 


Gly 


ASp 


Glu 


Val 


Thr 


Ala 








85 










90 










95 




GAG 


GTG 


GAG 


GTG 


ACC 


GCC 


CTT 


CGC 


GAG 


GAC 


AAG 


CCC 


ATC 


GCC 


ACC 


CTG 


Glu 


Val 


Glu 


Val 


Thr 


Ala 


Leu 


Arg 


Glu 


ASP 


Lys 


Pro 


He 


Ala 


Thr 


Leu 








100 










105 










110 






ACC 


ACC 


CGC 


ATC 


TTC 


ACC 


CAA 


GGC 


GGC 


GCC 


CTC 


GCC 


GTG 


ACG 


GGG 


GAA 


Thr 


Thr 


Arg 


He 


Phe 


Thr 


Gin 


Gly 


Gly 


Ala 


Leu 


Ala 


Val 


Thr 


Gly Glu 






115 










120 










125 








GCC 


GTG 


GTC 


AAG 


CTG 


CCT 


TAA 




















Ala 


Val 


Val 


Lys 


Leu 


Pro 
























130 






























(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 


6 : 

















48 
96 
144 
192 
240 
288 
336 
384 
405 



45 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



18 



10 



15 



20 



30 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6: 



Met 


Ser 


Ala 


Gin 


Ser 


Leu 


Glu 


Val 


Gly 


Gin 


Lys 


Ala 


Arg 


Leu 


Ser 


Lys 


l 


















10 










15 




Arg 


Phe 


Gly 


Ala 


Ala 


Glu 


val 


Ala 


Ala 


Phe 


Ala 


Ala 


Leu 


Ser 


Glu 


ASP 




20 










25 








Th'r 


30 






Phe 


Asn 


Pro 


Leu 


His 


Leu 


Asp 


Pro 


Ala 


Phe 


Ala Ala 


Thr 


Ala 


Phe 






35 










40 










45 








Glu 


Arg 
50 


Pro 


He 


Val 


His 


Gly 
55 


Met 


Leu 


Leu 


Ala 


Ser 
60 


Leu 


Phe 


Ser 


Gly 


Leu 


Leu 


Gly 


Gin 


Gin 


Leu 


Pro 


Gly 


Lys 


Gly 


Ser 


He 


Tyr 


Leu 


Oly 


Gin 


65 










70 










75 










80 


Ser 


Leu 


Ser 


Phe 


Lys 
85 


Leu 


Pro 


val 


Phe 


val 
90 


Gly 


Asp 


Glu 


val 


Thr 
95 


Ala 


Glu 


val 


Glu 


Val 
100 


Thr 


Ala 


Leu 


Arg 


Glu 
105 


ASP 


Lys 


Pro 


He 


Ala 
110 


Thr 


Leu 


Thr 


Thr 


Arg 
115 


Zle 


Phe 


Thr 


Gin 


Oly 
120 


Gly 


Ala 


Leu 


Ala 


Val 
125 


Thr 


Gly 


Glu 


Ala 


val 
130 


Val 


Lys 


Leu 


Pro 























(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 27 base pairs 
( D ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCSCCSTGGA TCAAYAAGTW YTAYATC 2 7 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 27 base pairs 
35 (B) TYPE: nucleic acid 

(C> STRANDEDNESS i single 
(D) TOPOLOGY: linear 

{11} MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA* 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

SAGCCASGCS GTCCARTCSG GCCACCA 27 

(2) INFORMATION FOR SEQ ID NO: 9i 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: DNA (genomic) 

tlx) FEATURE : 



55 



19 
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TO 



15 



20 



30 



45 



(A) NAME /KEY : CDS 

(B) LOCATION: 384.. 734. 



(ix) FEATURE : 

(A) NAME/KEY i CDS 

(B) LOCATION: 830. 



.2611 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

AGATCTGCAC CGGGGTGCTG GCCTGGGCCA CGCCGGCGAG GGCCAGCGCG GAGCAACCGA 
GCAGCAGGGC GAGAGGTTTC ATCGGGATTC CTTGGCAGTC TGAATG A CGT GCCAGCCTAT 
CAGCGCGGCG CCGGTGCGGC GAGGGCGCGC CGGACCCAGT GCGTCACCTC TCGTCTGATC 
CGCCTCCCTC GACGGGCGTC GCTGACAAAA AAATTCAAAC AGAAATTAAC ATTTATGTCA 
TTTACACCAA ACCGCATTTG GTTGCAGAAT GCT CAAACGT GTGTTTGAAC AGAGCAAGCA 
ACACGTAAAC AGGGATGACA TGCAGTACCC GTAAGAAGGG CCGATTGGCC CACAACAACA 
CTGTTCTGCC GAACTGGAGA CCG ATG ATG AAT ATG GAC GTG ATC AAG AGC 

Met Met Asm Mat Asp Val He Lys Ser 
1 5 



10 



15 



30 



45 



50 



60 



65 



75 



80 



90 



95 



Thr Ala Asp Gly 



He 
110 



10 



15 



30 



45 



50 



60 

CAC ACC CTG CTC AAA AGC GCA GGC CAG CCG AGC GAG CCG GTG ATC ACC 
His Thr Leu Leu Lys Ser Ala Gly Gin Pro Ser Glu Pro Val He Thr 



65 



70 



85 



CCG. GAG CGC AGC GAT CGC CGC TTC AAG GCC GAG GCC TGG AGC GAA CAA 
Pro Glu Arg Ser Asp Arg Arg Phe Lys Ala Glu Ala Trp Ser Glu Gin 

100 1 n q 



105 



CCC ATC TAT GAC TAC CTC AAG CAG TCC TAC CTG CTC ACC GCC AGG CAC 

Pro He Tyr Asp Tyr Leu Lys Gin Ser Tyr Leu Leu Thr Ala Arg His 

110 "5 x2 ° 

CTG CTG GCC TCG GTG GAT GCC CTG GAG GGC GTC CCC CAG AAG AGC CGC 



60 
120 
180 
240 
300 
360 
410 



GCC 


CCC 


CTC 


ACC CGC 


TAC 


AAC 


458 


Ala 


Pro 


Leu 


Thr Arg 


Tyr 


Asn 






20 








25 




CTG 


ACC 


COG 


TTG CAG 


CTG 


GCC 


DUO 


Leu 


Thr 


Arg 


Leu Gin 


Leu 


Ala 




35 








40 




554 


CTC 


AAC 


CAG 


TTG CAG 


GCC 


GTG 


Leu 


Asn 


Gin 


Leu Gin 


Ala 


val 










55 








GOG 


GCC 


CTG 


GGC ACA 


GTG 


CAA 


602 


Ala 


Ala 


Leu 


Gly Thr 


Val 


Gin 










70 








CAG 


ATG 


CTG 


GAT GAC 


ATC 


CAG 


650 


Gin 


Met 


Leu 


ASP Asp 


He 


Gin 








85 










AAG 


GAA 


GAG 


CTG GAT 


GTC 


CTG 


698 


LyB 


Glu 


Glu 


Leu Asp 


Val 


Leu 




100 








105 




GGC 


AAG 


GCC 


TGATAACCCC 




744 


Gly 


Lys 


Ala 










115 














TGACTCGA 


CGCTACGGGC TAGTTCCCGC 


804 


CAA 


CCA 


TCT 


TAT GGC 


CCG 


CTG' 


856 


Gin 


Pro 


Ser 


Tyr Gly 


Pro 


Leu 




AAG 


CTG 


5 

CTG 


GCC ATG 


GCC 


AAG 


904 


Lys 


Leu 


Leu 


Ala Met 


Ala 


Lye 




20 








25 




CTG 


CTG 


CAG 


ACC AAT 


CTG 


GAC 


952 


Leu 


Leu 


Gin 


Thr Asn 


Leu 


Asp 




35 








40 






AGC 


CAG 


CAA 


CCC TGG 


CAG 


CTG 


1000 


Ser 


Gin 


Gin 


Pro Trp 


Gin 


Leu 










55 








; GAT 


CAG 


CTC 


AAG CTG 


ATG 


CAG 


1048 


Asp 


Gin 


Leu 


Lys Leu 


Met 


Gin 





1096 
1144 
1192 
1240 



55 



20 




EPO 824 143 A2 



15 



20 



30 



35 



50 



L.eu 


Leu 


Ala 


Ser 


Val 


ASP 


Ala 


Leu 


Glu 


Gly 


Val 


Pro 


Gin 


Ly Bi- 


Ser 


Arg 








125 










130 










ll*' 






1288 


GAG 


CGG 


CTG 


CGT 


TTC 


TTC 


ACC 


CGC 


CAG 


TAC 


GTC 


AAC 


GCC 


ATG 


GCC 


CCC 


Glu 


Arg 


Leu 


Arg 


Phe 


Phe 


Thr 


Arg 


Gin 


Tyr 


Val 


Asn 


Ala 


Met 


Ala 


Pro 






140 










145 










150 










AGC 


AAC 


TTC 


CTG 


GCC 


ACC 


AAC 


CCC 


GAG 


CTG 


CTC 


AAG 


CTG 


ACC CTG 


GAG 


1336 


Ser 


Asn 


Phe 


Leu 


Ala 


Thr 


Asn 


Pro 


Glu 


Leu 


Leu 


Lys 


•Leu 


Thr 


Leu 


Glu 






155 










160 










165 










1384 


TCC 


GAC 


CGC 


CAG 


AAC 


CTG 


GTG 


CGC 


GGA 


CTG 


GCC 


CTC 


TTC 


GCC 


GAG 


GAT 


Ser 


Asp 


Gly 


Gin 


Asn 


Leu 


val 


Arg 


Gly 


Leu 


Ala 


Leu 


Leu 


Ala 


Glu 


Asp 




170 






175 










180 










185 




CTG 


GAG 


CGC 


AGC 


GCC 


GAT 


CAG 


CTC 


AAC 


ATC 


CGC 


CTG 


ACC 


GAC 


GAA 


TCC 


1432 


Leu 


Glu 


Arg 


Ser 


Ala 


Asp 


Gin 


Leu 


Asn 


He 


Arg 


Leu 


Thr Asp 


Glu 


Ser 










190 










195 










200 






GCC 


TTC 


GAG 


CTC 


GGG 


CGG 


GAT 


CTG 


GCC 


CTG 


ACC 


CCG 


GGC 


CGG 


GTG 


GTG 


1480 


Ala 


Phe 


Glu 


Leu 


Gly 


Arg 


Asp 


Leu 


Ala 


Leu 


Thr 


Pro 


Gly Arg 


val 


val 










205 










210 










215 








CAG 


CGC 


ACC 


GAG 


CTC 


TAT 


GAG 


CTC 


ATT 


CAG 


TAC 


AGC 


CCG 


ACT 


ACC 


GAG 


1528 


Gin 


Arg 


Thr 
220 


Glu 


Leu 


Tyr 


Glu 


Leu 
225 


He 


Gin 


Tyr 


Ser 


Pro 
230 


Thr 


Thr 


Glu 




ACG 


GTG 


GGC 


AAG 


ACA 


CCT 


GTG 


CTG 


ATA 


GTG 


CCG 


CCC 


TTC 


ATC 


AAC 


AAG 


1576 


Thr 


Val 
235 


Gly 


LyB 


Thr 


Pro 


val 
240 


Leu 


He 


Val 


Pro 


Pro 
245 


Phe 


He 


Asn 


Lys 




TAC 


TAC 


ATC 


ATG 


GAC 


ATG 


CGG 


CCC 


CAG 


AAC 


TCC 


CTG 


GTC 


GCC TGG 


CTG 


1624 


Tyr 


Tyr 


He 


Met 


Asp 


Met 


Arg 


Pro 


Gin 


Asn 


Ser 


Leu 


val 


Ala 


Trp 


Leu 




250 








255 










260 










265 




GTC 


GCC 


CAG 


GGC 


CAG 


ACG 


GTA 


TTC 


ATG 


ATC 


TCC 


TGG 


CGC 


AAC 


CCG 


GGC 


1672 


Val 


Ala 


Gin 


Gly 


Gin 


Thr 


val 


Phe 


Met 


He Ser 


Trp 


Arg 


Asn 


Pro 


Gly 












270 










275 










280 






GTG 


GCC 


CAG 


GCC 


CAA 


ATC 


GAT 


CTC 


GAC 


GAC 


TAC 


GTG 


GTG 


GAT 


GGC 


GTC 


1720 


val 


Ala 


Gin 


Ala 


Gin 


He 


Asp 


Leu 


Asp 


Asp 


Tyr 


Val 


Val 


ASp 


Gly val 










285 










290 










295 








ATC 


GCC 


GCC 


CTG 


GAC 


GGC 


GTG 


GAG 


GCG 


GCC 


ACC 


GGC 


GAG 


CGG 


GAG 


GTG 


1768 


He 


Ala 


Ala 
300 


Leu 


Asp 


Gly 


val 


Glu 
305 


Ala 


Ala 


Thr 


Gly 


Glu 
310 


Arg 


Glu 


val 




CAC 


GGC 


ATC 


GGC 


TAC 


TGC 


ATC 


GGC 


GGC 


ACC 


GCC 


CTG 


TCG 


CTC 


GCC 


ATG 


1816 


His 


Gly 
315 


He 


Gly 


Tyr 


Cys 


He 
320 


Gly 


Gly 


Thr 


Ala 


Leu 
325 


ser 


Leu 


Ala 


Met 




GGC 


TGG 


CTG" 


GCG 


GCG 


CGG 


CGC 


CAG 


AAG 


CAG 


CGG 


GTG 


CGC 


ACC 


GCC 


ACC 


1864 


Gly 


Trp 


Leu 


Ala 


Ala 


Arg 


Arg 


Gin 


Lys 


Gin 


Arg 


Val 


Arg 


Thr 


Ala 


Thr 




330 










335 










340 










345 




CTG 


TTC 


ACT 


ACC 


CTG 


CTG 


GAC 


TTC 


TCC 


CAG 


CCC 


GGG 


GAG 


CTT 


GGC 


ATC 


1912 


Leu 


Phe 


Thr 


Thr 


Leu 
350 


Leu 


ABp 


Phe 


Ser 


Gin 
355 


Pro 


Gly 


Glu 


Leu 


Gly 
360 


He 




TTC 


ATC 


CAC 


GAG 


CCC 


ATC 


ATA 


GCG 


GCG 


CTC 


GAG 


GCG 


CAA 


AAT 


GAG 


GCC 


1960 


Phe 


He 


Ftis 


Glu 
365 


Pro 


He 


He 


Ala 


Ala 
370 


Leu 


Glu 


Ala 


Gin 


Asn 

375 


Glu 


Ala 




AAG 


GGC 


ATC 


ATG 


GAC 


GGG 


CGC 


CAG 


CTG 


GCG 


GTC 


TCC 


TTC 


AGC 


CTG 


CTG 


2008 


Lys 


Gly 


He 
380 


Met 


Asp 


Gly 


Arg 


Gin 
385 


Leu 


Ala 


Val 


Ser 


Phe 
390 


Ser 


Leu 


Leu 




COG 


GAG 


AAC 


AGC 


CTC 


TAC 


TGG 


AAC 


TAC 


TAC 


ATC 


GAC 


AGC 


TAC 


CTC 


AAG . 


2056 


Arg 


Glu 
395 


Afln 


Ser 


Leu 


Tyr 


Trp 
400 


Asn 


Tyr 


Tyr 


He 


Asp 
405 


Ser 


Tyr 


Leu 


Lys 




GGT 


CAG 


AGC 


CCG 


GTG 


GCC 


TTC 


GAT 


CTG 


CTG 


CAC 


TGG 


AAC 


AGC 


GAC 


AGC 


2104 


Gly 


Gin 


Ser 


Pro 


val 


Ala 


Phe 


Asp 


Leu 


Leu 


His 


Trp 


Asn 


Ser 


Asp 


Ser 




410 










415 










420 










425 




ACC 


AAT 


GTG 


GCG 


GGC 


AAG 


ACC 


CAC 


AAC 


AGC 


CTG 


CTG 


CGC 


CGT 


CTC 


TAC 


2152 


Thr 


Afin 


Val 


Ala 


Gly 
430 


Lys 


Thr 


His 


Asn 


Ser 
435 


Leu 


Leu 


Arg 


Arg 


Leu 

440 


Tyr 




CTG 


GAG 


AAC 


CAG 


CTG 


GTG 


AAG 


GGG 


GAG 


CTC 


AAG 


ATC 


CGC 


AAC 


ACC 


CGC 


2200 


Leu 


Glu 


Afin 


Gin 
445 


Leu 


val 


Lys 


Gly 


Glu 
450 


Leu 


Lys 


He 


Arg 


Asn 
455 


Thr 


Arg 




ATC 


GAT 


CTC 


GGC 


AAG 


GTG 


AAG 


ACC 


CCT 


GTG 


CTG 


CTG 


GTG 


TCG 


GCG 


GTG 


2248 


He 


Asp 


Leu 
460 


Gly 


Lys 


Val 


Lys 


Thr 
465 


Pro 


val 


Leu 


Leu 


Val 
470 


Ser 


Ala 


val 
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GAC GAT CAC ATC GCC CTC TGG GAG GGC ACC TGG CAG GGC ATG AAG CTG 229 6 
Asp' Asp His lie Ala Leu Trp Gin Gly Thr Trp Gin Gly Met Lys Leu 

475 480 4fl 5 

T*T GGC GGG GAG CAG CGC TTC CTC CTG GCG GAG TCC GGC CAC ATC GCC * 2344 
Phe Gly Gly Glu Gin Arg Phe Leu Leu Ala Glu Ser Gly His lie Ala 
490 495 500 505 

GGC ATC ATC AAC CCG CCO GCC GCC AAC AAG TAC GGC TTC' TGG CAC AAC 2392 
Glv lie He Asn Pro Pro Ala Ala Asn Lys Tyr Gly Phe Trp His Asn 

510 515 520 

GGG GCC GAG GCC GAG AGC CCG GAG AGC TGG CTG GCA GGG GCG ACC CAC 2440 
Glv Ala Glu Ala Glu Ser Pro Glu ser Trp Leu Ala Gly Ala Thr His 

525 530 535 

CAG GGC GGC TCC TGG TGG CCC GAG ATG ATG GGC TTT ATC CAG AAC CGT 2488 
Gin Gly Gly Ser Trp Trp Pro Glu Met Met Gly Phe He Gin Asn Arg 

540 545 550 

GAC GAA GGG TCA GAG CCC GTC CCC GCG CGG GTC CCG GAG GAA GGG CTG 2536 
Asp Glu Gly Ser Glu Pro Val Pro Ala Arg val Pro Glu Glu Gly Leu 

555 560 565 

GCC CCC GCC CCC GGC CAC TAT GTC AAG GTG CGG CTC AAC CCC GTG TTT 2 5 84 
Ala pro Ala Pro Gly His Tyr val LyB Val Arg Leu Asn Pro Val Phe 
570 575 580 585 

GCC TGC CCA ACA GAG GAG GAC GCC GCA TGAGCGCACA ATCCCTGGAA 2631 
Ala Cys Pro Thr Glu Glu Asp Ala Ala 
590 

GTAGGCCAGA AGGCCCGTCT CAGCAAGCGG TTCGGGGCGG CGGAGGTAGC CGCCTTCGCC 2691 
GCGCTCTCGG AGGACTTCAA CCCCCTGCAC CTGGACCCGG CCTTCGCCGC CACCACGGCG 2 751 
TTCGAGCGGC CCATAGTCCA CGGCATGCTG CTCGCCAGCC TCTTCTCCGG GCTG CTGGGC 2811 
CAGCAGTTGC CGGGCAAGGG GAGCATCT AT CTGGGTCAAA GCCTCAGCTT CAAGCTGCCG 2871 
GTCTTTGTCG GGGACGAGGT GACGGCCGAG GTGGAGGTGA CCGCCCTTCG ,CGAGGACAAG 2931 
CCCATCGCCA CCCTGACCAC CCGCATCTTC ACCCAAGGCG GCGCCCTCGC CGTGACGGGG 2991 
GAAG CCGTGG TCAAGCTGCC TTAAGCACCG GCGGCACGCA GGCACAATCA GCCCGGCCCC 3051 
TGCCGGGCTG ATTG TTCTCC CCCGCTCCGC TTGCCCCCTT TTT CGGGGCA ATTTGGCCCA 3111 
GGCCCTTTCC CTGCCCCGCC TAACTGCCTA AAATGGCCGC CCT GCCGTGT AGGCATTCAT 3171 
CCAGCTAGAG GAATTC 



(3) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3187 base pairs 

(B) TYPE r nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE* 

(A) NAME/KEY: CDS 

(BJ LOCATION: 2611.. 3 012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AGATCTGGAC CGGGGTOCTG GCCTGGGCCA CGCCGGCGAG GGCCAGCGCG GAG CAACCGA 60 

GCAGCAGGGC GAGAGGTTTC ATCGGGATTC CTTGGCAGTC TGAATGACGT GCCAGCCTAT 120 

CAGCGCGGCG CCGGTGCGGC GAGGGCGCGC CGGACCCAGT GCGTCACCTC TCGTCTGATC 180 

CGCCTCCCTC GACGGGCGTC GCTGACAAAA AAATTCAAAC AGAAATTAAC ATTTATGTCA 240 

TTTACACCAA ACCGCATTTG GTTGCAGAAT GCTCAAACGT GTGTTTGAAC AGAGCAAGCA 3 00 

ACA CGT AAA C AGGGATGACA TGCAGTACCC GTAAGAAGGG CCGATTGGCC CACAACAACA 3 60 

CTGTTCTGCC GAACTGGAGA CCGATGATGA ATATGGACGT GATCAAGAGC TTTACCGAGC 4 20 

AGATGCAAGG CTTCGCCGCC CCCCTCACCC GCTACAACCA GCTGCTGGCC AGCAACATCG 480 

AA CAGCTG A C CCGGTTGCAG CTGGCCTCCG CCAACGCCTA CGCCGAACTG GGCCTCAACC 540 

AGTTGCAGGC CGTGAGCAAG GTGCAGGACA CCCAGAGCCT GGCGGCCCTG GGCACAGTGC 600 

AACTGGAG A C CGCCAGCCAG CTCTCCCGCC AGATGCTGGA TGACATCCAG AAGCTGAGCG 660 
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10 



CCCTCGGCCA GCAGTTCAAG GAAGAGCTGG ATGTCCTGAC CGCAGACGGC ATCAAGAAAA 720 
' GCACGGGCAA GGCCTGATAA CCCCTGGCTG CCCGTTCGGG CAG CCA CATC TCCCCATGAC 780 
- TCGACGCTAC GGGCTAGTTC CCGCCTCGGG TGTGGGTGAA GGAGAGCACA TGAGCCAACC 840 
ATCTTATGGC CCGCTGTTCG AGGCCCTGGC CCACTACAAT GACAAGCTGC TGGCCATQBC 900 
CAAGGCCCAG ACAGAGCGCA CCGCCCAGGC GCTGCTGCAG ACCAATCTGG ACGATCTGGG 960 
CCAGGTQCTG GAGCAGGGCA GCCAGCAACC CTGGCAGCTG ATCCAGGCCC AGATGAACTG 1020 
GTGGCAGGAT CAGCTCAAGC TGATGCAGCA CACCCTGCTC AAAAQC'GCAG GCCAGCCGAG 1080 
CGAGCCGGTG ATCACCCCGG AGCGCAGCGA TCGCCGCTTC AAGGCCGAGG CCTGGAGCGA 1140 
ACAACCCATC TATGACTACC TCAAGCAGTC CTACCTGCTC ACCGCCAGGC ACCTGCTGGC 1200 
CTCGGTGGAT GCCCTGGAGG GCGTCCCCCA GAAGAGCCGG GAGCGGCTGC GTTTCTTCAC 1260 
CCGCCAGTAC GTCAACGCCA TGGCCCCCAG CAACTTCCTG GCCACCAACC CCGAGCTGCT 1320 
CAAGCTGACC CTGGAGT CCG ACGGCCAGAA CCTGGTGCGC GGACTGGCCC TCTTGGCCGA 1380 
GGATCTGGAG CGCAGCGCCG AT CAGCTCAA CATCCGCCTG ACCGACGAAT CCGCCTTCGA 1440 
QCTCGGOCGO GATCTGGCCC TGACCCCGCG CCGGGTGGTG CAGCGCACCG AGCTCTATGA 1500 
GCTCATTCAG TACAGCCCGA CTACCGAGAC GGTGGGCAAG ACACCTGTGC TGATAGTGCC 1560 
GCCCTTCATC AACAAGTACT ACATCATGGA CATGCGGCCC CAGAACTCCC TGGTCGCCTG 162 0 
15 GCTGGTCGCC CAGGGCCAGA CGGTATTCAT GATCTCCTGG CGCAACCCGG GCGTGGCCCA 1680 

GGCCCAAATC GATCTCGACG ACTACGTGGT GGATGGCGTC ATCGCCGCCC TGGACGGCGT 1740 
GGAGGCGGCC ACCGGCGAGC GGGAGGTGCA CGGCAT CGGC TACTGCATCG GCGGCACCGC 1800 
CCTGTCGCTC GCCATGGGCT GGCTGGCGGC GCGGCGCCAG AAGCAGCGGG TGCGCACCGC 1860 
CACCCTGTTC ACTACCCTGC TGGACTTCTC CCAGCCCGGG GAGCTTGGCA TCTTCATCCA 1920 
CGAGCCCATC ATAGCGGCGC TCGAGGCGCA AAATGAGGCC AAGGGCATCA TGGACGGGCG 1980 
2Q CCAGCTGGCG GTCTCCTTCA GCCTGCTGCG GGAGAACAGC CTCTACTGGA ACTACTACAT 204 0 

CGACAGCTAC CTCAAGGGTC AGAGCCCGGT GGCCTT CGAT CTGCTGCACT GGAACAGCGA 2100 
CAGCACCAAT GTGGCGGGCA AGACCCACAA CAGC CTGCTG CGCCGTCTCT ACCTGGAGAA 2160 
CCAGCTGGTG AAGGGGGAGC TCAAGATCCG CAACACCCGC ATCGATCTCG GCAAGGTGAA 2220 
GACCCCTGTG CTGCTGGTGT CGGCGGTGGA CGAT CA CATC GCCCTCTGGC AGGGCACCTG 2280 
G CAGGC CATG AAGCTGTTTG GCGGGGAGCA GCGCTTCCTC CTGGCGGAGT CCGGCCACAT 2340 
CGCCGGCATC ATCAACCCGC CGGCCGCCAA CAAGT A CGGC TTCTGGCACA ACGGGG CCG A 2400 
25 GGCCGAGAGC CCGGAGAGCT GG CTGGCAGG GG CG A CGCAC CAGGGCGGCT CCTGGTGGCC 2460 

CGAGATGATG GGCTTTATCC AG AA CCGTGA CGAAGGGTCA GAGCCCGTCC CCGCGCGGGT 2520 
CCCGGAGGAA GGGCTGGCCC CCGCCCCCGG CCACTATGTC AAGGTGCGGC TCAACCCCGT 2580 
GTTTGCCTGC CCAACAGAGG AGGACGCCGC ATG AGC GCA CAA TCC CTG GAA GTA 2634 

Met Ser Ala Gin Ser Leu Glu Val 
1 5 

30 GGC CAG AAG GCC CGT CTC AGC AAG CGG TTC GGG GCQ GCG GAG GTA GCC 26B2 

Gly Gin Lys Ala Arg Leu Ser Lys Arg Phe Gly Ala Ala Glu Val Ala 

10 15 20 

GCC TTC GCC GCG CTC TCG GAG GAC TTC AAC CCC CTG CAC CTG GAC CCG 273 0 
Ala Phe Ala Ala Leu Ser Glu Asp Phe Asn Pro Leu Hie Leu Asp Pro 
25 30 35 40- 

35 GCC TTC GCC GCC ACC ACG GCG TTC GAG CGG CCC ATA GTC CAC GGC ATG 2778 

Ala Phe Ala Ala Thr Thr Ala Phe Glu Arg Pro lie Val His Gly Met 

45 50 55 

CTG CTC GCC AGC CTC TTC TCC GGG CTG CTG GGC CAG. CAG TTG CCG GGC 2826 
Leu Leu Ala Ser Leu Phe Ser Gly Leu Leu Gly Gin Gin Leu Pro Gly 
60 65 70 

x&r* nnn »nr 

40 



45 



AAG 


GGG 


AGC 


ATC 


TAT 


CTG 


GGT 


CAA 


AGC 


CTC 


AGC 


TTC 


AAG 


CTG 


CCG 


GTC 


2874 


Lys 


Gly 


Ser 
75 


lie 


Tyr 


Leu 


Gly 


Gin 
80 


Ser 


Leu 


ser 


Phe 


Lys 
85 


Leu 


Pro 


val 




TTT 


GTC 


GGG 


GAC 


GAG 


GTG 


ACG 


GCC 


GAG 


GTG 


GAG 


GTG 


ACC 


GCC 


CTT 


CGC 


2922 


Phe 


Val 
90 


Gly 


Asp 


Glu 


val 


Thr 
95 


Ala 


Glu 


val 


Glu 


Val 
100 


Thr 


Ala 


Leu 


Arg 




GAG 


GAC 


AAG 


CCC 


ATC 


GCC 


ACC 


CTG 


ACC 


ACC 


CGC 


ATC 


TTC 


ACC 


CAA 


GGC 


2970 


Glu 


ASp 


Lys 


Pro 


lie 


Ala 


Thr 


Leu 


Thr 


Thr 


Arg 


He 


Phe 


Thr 


Gin 


Gly 




105 










110 










115 










120 




GGC 


GCC 


CTC 


GCC 


GTG 


ACG 


GGG 


GAA 


GCC 


GTG 


GTC 


AAG 


CTG 


CCT 






3012 


Gly 


Ala 


Leu 


Ala 


val 
125 


Thr 


Gly 


Glu 


Ala 


val 
130 


Val 


Lys 


Leu 


Pro 









TAAGCACCGG CGGCACGCAG GCACAATCAG CCCGGCCCCT GCCGGGCTGA TTGTTCTCCC 3072 

CCGCTCCGCT TGCCCCCTTT TTCGGGGCAA TTTGCCCCAG GCCCTTTCCC TGCCCCGCCT 313 2 

AACTGCCTAA AATGGCCGCC CTGCCGTGTA GCCATTCATC CAGCTAGAGG AATTC 3187 

(2) INFORMATION POR SEQ ID NO i 11: 
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10 



15 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDN.ES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE r other nucleic acid 

(A) DESCRIPTION: /desc ■ "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGTTCCCGCC TCGGGTGTOO GTGAA 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
20 (a) DESCRIPTION: /desc = "synthetic DNA* 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGCATATGCG CTCATGCGGC GTCCT 25 



(2) INFORMATION FOR SEQ ID NO: 13 i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : Bingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ■ "synthetic dna- 

(xi) SEQUENCE DESCRIPTION i SEQ ID NO: 13: 

GCCATATGAG CGCACAATCC CTGGAAGTAG 

(2) INFORMATION POR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
46 (A) DESCRIPTION: /desc - "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: . 

CTGGGATCCG CCGGTGCTTA AGGCAGCTTG 

60 

(2) INFORMATION FOR SEQ ID NO: 15 t 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
( C) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : . peptide . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ser Ala Gin Ser Leu Glu Val Gly Gin LyB Ala Arg Leu Ser Lys Arg 

15 10 15 

Phe Gly Ala Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ser Ala Gin Ser Leu Glu Val Gly Gin Lys Ala Arg Leu Ser Lys 

15 10 15 

Arg Phe Gly Ala Ala 
20 



Claims 



1. A polyester synthase gene coding for a polypeptide containing the amino acid sequence of SEQ ID NO:2 or a 
sequence where in said amino acid sequence, one or more amino acids are deleted, replaced or added, said 

40 polypeptide bringing about polyester synthase activity. 

2. A polyester synthase gene comprising the nucleotide sequence of SEQ ID NO:1 . 

3. A gene expression cassette comprising the polyester synthase gene of claims 1 or 2 and either of open reading 
45 frames located upstream and downstream of said gene. 

4. The gene expression cassette according to claim 3, wherein the open reading frame located upstream of the poly- 
ester synthase gene comprises DNA coding for the amino acid sequence of SEQ ID NO: 4. 

so 5. The gene expression cassette according to claim 3, wherein the open reading frame located upstream of the poly- 
ester synthase gene comprises the nucleotide sequence of SEQ ID NO:3. 

6. The gene expression cassette according to claim 3. wherein the open reading frame located downstream of the 
polyester synthase gene comprises DNA coding for a polypeptide containing the amino acid sequence of SEQ ID 

55 NO:6 or a sequence where in said amino acid sequence, one or more amino acids are deleted, replaced or added, 
said polypeptide bringing about enoyl-CoA hydratase activity. 

7. The gene expression cassette according to daim 3. wherein the open reading frame located downstream of the 
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polyester synthase gene comprises the nucleotide sequence of SEQ ID NO:5. 

8. A recombinant vector comprising the polyester synthase gene of claim t or 2 or the gene expression cassette of 
any one of claims 3 to 7. 

9. A transformant transformed with the recombinant vector of claim 8. 

10. A process for producing polyester, wherein the transformant of claim 9 is cultured in a medium and polyester is 
recovered from the resulting culture. 

11. The process for producing polyester according to claim 10. wherein the polyester is a copolymer of 3-hydroxyalka- 
noic acid represented by formula I: 

R 

I <*> 
HO — CH - CH 2 — COOH 

wherein R represents a hydrogen atom or a C1 to C4 alkyt group. 

1 2. The process for producing polyester according to claim 1 0, wherein the polyester is a poly(3-hydraxybutyf ate-co-3- 
hydroxyhexanoate) random copolymer. 
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FIG. 1A 
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FIG.2 



M 



94 kDa 
67 kDa 

43 kDa 



3 0 kDa 



21.1 kDa ite^j 



14.4 kDa 



Lane M: molecular -weight marker 

Lane 1: soluble-protein fraction from NB3 

Lane 2 : active fraction eluted from the anion 
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